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THE DOMINATING COLOUR OF AN INFINITE POLYA URN MODEL 


ERIK THORNBLAD 


ABSTRACT. We study a Polya-type urn model defined as follows. Start at time 0 with a single 
ball of some colour. Then, at each time n > 1, choose a ball from the urn uniformly at random. 
With probability 1/2 < p < 1, return the ball to the urn along with another ball of the same 
colour. With probability 1 — p, recolour the ball to a new colour and then return it to the urn. 
This is equivalent to the supercritical case of a random graph model studied by Backhausz and 
Mori 1313 and Thornblad (HI- We prove that, with probability 1, there is a dominating colour, 
in the sense that, after some random but finite time, there is a colour that always has the most 
number of balls. A crucial part of the proof is the analysis of an urn model with two colours, in 
which the observed ball is returned to the urn along with another ball of the same colour with 
probability p, and removed with probability 1 — p. Our results here generalise a classical result 
about the Polya urn model (which corresponds to p = 1). 

Keywords: urn model, largest colour, random graphs, persistent hub. 
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1. Introduction 

We study an urn model described as follows. At time 0, start with a single ball of some colour. 
At each time step n > 1, choose a ball uniformly at random. 

(1) With probability p, return the ball to the urn along with another ball of the same colour. 

(2) With probability 1 — p, recolour the ball with a new colour and then return it to the urn. 

In this paper we will typically consider the case p > 1/2, although the definition or the urn 
model makes sense for any 0 < p < 1. This urn model has a (countably) infinite number 
of colours. It also allows for the extinction of colours. If the last ball of a certain colour is 
recoloured, then this colour will never appear again in the urn. It is equivalent to the following 
random graph model studied in 015], [321 ■ Let G$ be the graph with a single isolated vertex. 
Create G n from G n _i by doing one of the following steps. 

(1) With probability p, do a duplication step. Select a clique in G n -\ with probability 
proportional to size, and introduce a new vertex to that clique. 

(2) With probability 1 — p, do a deletion step. Select a clique in G n - 1 with probability 
proportional to size and delete a vertex from the chosen clique. Then introduce a new 
clique with a single vertex. 

The equivalence of these models is clear once we identify each clique with a colour in the urn. 

Let us mention a few known results about the urn model, coming from |(4j|5][FZ]]. These results 
were originally proved in the random graph version, but transfer immediately to the urn version. 
The degree distribution is known to exhibit a phase transition from exponential decay to power 
law in the three regimes 0 < p < 1/2, p = 1/2 and 1/2 < p < 1, referred to as the subcritical, 
critical and supercritical case, respectively. Knowledge of the degree distribution of the random 
graph model translates to knowing the almost sure limits of the quantities U ^ n , where Uj_ n is 
the number of colours at time n with j balls, and N n is the number of balls at time n. This was 
done for p = 1/2 in 0 and for the remaining cases in ffTTl . Both exact and asymptotic results 
were found. Later Backhausz and Mori 0 revisited the model and determined bounds on the 
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logarithmic growth rate of the maximal clique size of the graph, i.e. the number of balls of the 
leading colour. In particular, in the supercritical case p > 1/2 they found that 


( 1 ) 


^ < liminf 

p n —>-oo 


log Mn 
log N n 


< liminf 

n—>oo 


log Mn 
log N n 



where ft = 2 / J { , M n is the size of the leading colour at time n, and N n is the number of 
balls in the urn at time n. As we shall see later, this result can be strengthened to show that 
M n ~ for some random variable // > 0, implying that the upper bound in ([[} is the 

correct one. This was indeed the correct growth rate conjectured in @. We remark that this 
result was originally put in terms of the maximal degree, which is equal to one less than the 
maximal clique size, which by the identification is equal to the number of balls of the leading 
colour. 

Similar studies have been done for population models. Champagnat and Lambert (6l studied 
a population model in which individuals were given i.i.d. lifetime distributions and give birth at 
constant rate. Furthermore, individuals then mutate at constant rate and change allelic type. If 
the lifetime distribution has a unit point mass as oo, births are at rate p and mutations at rate 1 —p, 
then this corresponds to a continuous-time version of our model, where colours correspond to 
the alleles. Champagnat and Lambert achieved a number of convergence results, in all three 
regimes, about the oldest and most abundant families, mainly in expectation and distribution. 
By constrast, we shall derive almost sure results, but only in the supercritical regime. 

One of our main results is that, provided p > 1/2, then, with probability 1, there is some 
colour that after some random but finite time becomes dominant, i.e. remains the colour with 
the most number of balls forever. A similar problem was studied by Khanin and Khanin I T2I 1. 
They consider an urn model with k colours, with a parameter r > 0. Balls are added sequen¬ 
tially, and the probability of adding a ball of colour 1 is proportional to x r , where x is the 
number of balls of colour 1, etc. They show that for r > 1/2 one of the colours will be even¬ 
tually dominant almost surely, but for r < 1/2 the colours change leadership infinitely many 
times. Indeed, for r > 1, all but finitely many of the new balls are of the same colour, creating a 
monopoly of one of the colours. Similar results were achieved by Chung, Handjani and Jungreis 
171 for an infinite urn model. This model works as follows (slightly rephrased to allow for more 
direct comparison to our model). With probability p, add a ball, the colour of which is chosen 
like in the Khanin/Khanin-model. With probability 1 — p, add a ball of a new colour. The 
difference to our urn model is that colours can never lose balls in the Chung/Handjani/Jungreis- 
model. Also, the number of balls in the Chung/Handjani/Jungreis-model grows deterministi¬ 
cally, which makes analysis slightly easier. Although these models appear similar, qualititively 
they behave rather differently. For r = 1 it is found that the number of colours of size k in the 
Chung/Handjani/Jungreis-model decays like a power-law for all 0 < p < 1. This should be 
contrasted with our model, where there is a phase transition from exponential decay to power 
law at p = 1/2. 

In the random graph interpretation of our model, the notion of a dominant colour should 
be seen as a variation of the notion of a persistent hub, a concept considered by Dereich and 
Morters lfl4l and by Galashin (5). The latter considered the classical preferential attachment 
model and a variation thereof and showed that, almost surely, there is a vertex that after some 
random but finite time (and always thereafter) is the vertex of maximal degree in the graph. This 
vertex is called the persistent hub. As remarked in [5|, our random graph model does not have 
a persistent hub on the level of vertices, since all vertices will be selected for deletion infinitely 
often, thus pushing their degree down to 0. However, by using the correspondence between 
vertices in a clique and balls of a certain colour, we will see that there is a clique that almost 
surely is the largest one, i.e. a persistent clique-hub. 

The rest of this paper is outlined as follows. We will typically embed the discrete urn model 
in a corresponding continuous-time model. First we use the contraction method to determine 
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the growth rate of a fixed colour. Then, to show that there is an eventually dominating colour, 
we follow the approach taken by Galashin JS]. His methods extrapolate to our setting without 
any significant changes. We start by observing the joint behaviour of two fixed colours and 
determine exactly the asymptotic distribution of the quantity w B f B , where B n and W n are the 
number of balls of the two colours at time n, respectively. From this distribution we deduce that 
two colours can change relative leadership at most finitely many times. Furthermore, we can 
determine exactly the probability that one of the colours ever overtakes the other, which allows 
us to bound the probability that a colour with only one ball (a new colour) will overtake a colour 
with many balls (the leading colour). This probability will turn out to be sufficiently small, in the 
sense that we can apply the Borel-Cantelli lemma to show that colours with few balls overtake 
the currently leading colour only a finite number of times, with probability 1. This, along with 
the fact that two fixed colours overtake each other at most a finite number of times, implies the 
existence of a dominating colour. This dominating colour must grow like some fixed colour, so 
we are able to determine the asymptotic growth rate of the largest colour of the urn. 

In line with 10133, let us introduce the notation (5 = [ and 7 = Yji. We use the notation 

a n b n for (possibly random) sequences (a rt )/L, and (b n )™ =1 to mean lim.,^ >oc 1 and 

X ~ F to denote that the random variable X has distribution F. 


2. Results 


Inspired by Athreya’s embedding scheme, see |j3] V.9], we shall embed the discrete urn 
scheme in a continuous-time urn scheme. The continuous urn scheme is defined as follow. 
Each ball in the urn has two exponential clocks, one ringing at rate 1/2 < p < 1 and the other 
at rate 1 — p. If the first clock rings, add to the urn another ball of the same colour. If the second 
clock rings, remove the ball from the urn and add a ball of a new colour. It is easy to see that 
the discrete process has the same transition probabilities as the continuous process. Moreover, 
whenever a new colour is created, it behaves like a birth-death process with birth rate p and 
death rate 1 — p. We characterise the growth rate of such a birth-death process in the next 
lemma. 

Lemma 2.1. Let (X(t))t>o be a continuous-time birth-death process with birth rates p and 
death rates 1 — p. Then 


X(t) a.s. 
e (2p-l)(t) 


( 1 >?) +7<*o. 


where U is a random variable with distribution (1 — 7 ) T I 1 


Note that So denotes the distribution that places unit mass at 0. The quantity 7 is the extinction 
probability (which can be found in many different ways). After some work, one can show that 
this follows from the results in ||3] III.5]. We instead use the contraction method, which exploits 
the recursive structure of the process. We sketch the argument here, and refer the reader to 
□ana for further details and references. 

Proof. Let r be the first event time in the process X(t). With probability p, this event is a birth, 
and with probability 1 — p, it is a death (which then forces X(t) = 0 for all t > r). This leads 
to the distributional equality 


X{t) i l { r<t}Y (xft - r) + x"{t - r)) + l {T>t} 


where X(t),X'(t) and X"(t) are independent and identically distributed, Y ~ Ber(p) and 
r ~ Exp(l). Normalising we obtain 



e (2p-l)(i-r) e (2p—l)(t— t) 


X’jt-T) | X"{t~T) 
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Note that l{ r >t} —> 0 as t oo; similarly IfrCt} —> 1 as t —> oo. It can be shown that 
X (t) / e^ 2p ~ l ' ,t is a non-negative martingale, so by the martingale convergence theorem it con¬ 
verges almost surely to some random variable U, which then must satisfy the distributional 
equality 

(2) U = e -Qp-V T Y (U' + U") , 

where U, U', U" are independent and identically distributed, Y ~ Ber(p) and t Exp(l). 

Consider the space M. of distributions with finite second moment and first moment equal to 
1, equipped with the Wasserstein metric 

d( Ai, A 2 ) = inf ||Zi — Z 2 W 2 


where the infimum is over all random variables Z\ and Z2 with Z\ ~ Ai and Z-) ~ A 2 
and || • ||2 denotes the /, 2 -norm. Let T : M —> M be the distributional operator TZ = 
e -(2p-i ) t y (Z' + Z"), with r, Y independent and like above, and Z', Z" independent and dis¬ 
tributed like Z. Note in particular that TZ E M for any Z E M, so this operator is well- 

defined. It can be shown that E[2e _ 2 ( 2 p_ 1 ) T Y 2 ] = < 1 is a contracting factor in the 

Wasserstein metric for the operator T. By the Banach fixed point theorem, it follows that the 
([2]) has a unique solution in Ad, see e.g. lfT6l Theorem 2.2]. Moreover, it can be verified that 
the distribution (1 — 7 )T( 1,1 /(3) + 7 ^ lies in Ad and satisfies <0 so this must be the unique 
solution. It suffices now to show that the distribution of U lies in Ad, i.e. that E[C7] = 1 and 
E[[/ 2 ] < 00 . It follows from f3] III.4-5] that 


Var(X(f)) 


e 2(2p—i)t^q — e - ( 2p-1 ^) 
2 p- 1 


which implies that Var (X(t)/e^ 2p l ' ,t ) < f° r a H t > 0 and in particular that the second 
moment of the limiting random variable U is finite. Therefore the first moment of U is also finite, 
and it follows that E [U] = 1 since U is the limit of a martingale sequence with expectation one. 
Therefore the distribution of the limiting random variable must be in Ad, so the contraction 
method shows that U (1 — 7 )r(i, 1/ p ) + 7^0- 

□ 


Using this we can relate the growth rate of a fixed colour to the growth rate of the entire urn. 

Theorem 2.2. Let X n be the size of a fixed colour at time n. Conditional on survival of this 
colour, there exists a positive random variable v such that 

X n vNW. 

Proof We make use of the continuous-time embedding described earlier. For this purpose, 
let X (t) be the number of balls of a fixed colour at time t in the corresponding continuous 
model. Suppose this colour was born at time to- By Lemma |2. 1 1 conditional on survival, we 
have X(t ) c ~' e ( ' 2p ~ 1 ^ t ~ t °' ) U, where U ~ T(l, 1 /fi). Denoting by N(t) the total number of 
balls by time t, similarly one can show that N(t) a ~' e pt V where V ~ T(l, 1). This is done in 

2p—l 

151 . Therefore, conditional on survival of the fixed colour, we have X(t) ~ vN{t) <• almost 
surely, where v is a positive random variable (that depends on to, U and V). 

Now, let T n be the n:th birth or death of the process (X(t))t> o- Observing the values 
(X(T n ))™ =l gives us the discrete urn process, so it suffices to show that T n 00 as n —>• 00 . 
This is well-known if p = 1, see e.g. (3l III], and is equivalent to showing that the process does 
not explode in finite time. But we can couple our birth-death process X (t) for 1/2 < p < 1 to a 
pure birth process with p = 1, in such a way that it grows slower. Hence it also cannot explode 
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in finite time, so T n —> oo on the event of non-extinction. Recall now that 2p p 1 = A. 

x n w V N}jr 


Then 

□ 


As mentioned, a colour survives with probability 1 — 7 and goes extinct with probability 7, 
so this gives us the following corollary. 


Corollary 2.3. Let X n be the size of a fixed colour at time n. Then there exists a positive 
random variable v > 0 such that 

a.s. JO, with probability 7, 

s 1//3 

I z/iV n , with probability 1 — 7. 


Let us now turn to the joint behaviour of two fixed colours. To do this, we consider the 
projection of the entire urn onto two colours. That is, we study an urn model with balls of two 
colours, black and white say. Initially there are Bq = b black balls and Wq = w white balls. 
Sequentially sample balls from the urn, uniformly at random. With probability 1/2 < p < 1, 
return the ball to the urn with a ball of the same colour, and with probability 1 — p remove the 
urn from the urn with negative probability. To avoid degeneracies we stop the evolution of the 
urn if one of the colours disappear from the urn. Conditional on B n f 0, W n f 0, the transition 
probabilities are given by 

with probability Pjjpfwy 
with probability P B ^ Wn 
with probability (1 - p) B ^ n 
with probability (1 - p) jjywwp 

A similar urn model was studied in f9|, where the urn came with an additional immigration 
procedure to ensure that both colours survive. We mention the paper ifTOl . that to a large extent 
solved the case of so-called tenable urn models. Our urn model does not fall into this category 
(in particular the condition of irreducibility is not satisfied, i.e. that any configuration of the urn 
can be reached from any starting configuration). However, the process ( B n , Wnf^L 0 can be seen 
as a triangular urn scheme with random replacement matrix, allowing for non-negative entries. 
Triangular urn schemes with deterministic replacement matrix were considered by Janson lUTll . 
the results of which were extended by Aguech fT| to random triangular replacement matrices. 
The latter however imposed that the random variables be non-negative, to ensure the survival 
of the urn. Our urn model does not have non-negative entries; however, the condition p > 1/2 
implies that the replacement distribution has positive expectation, so the colours (and the urn) 
survive with positive probability. 

We are interested in the joint behaviour of two colours, or more precisely the proportion 


(-®7l+l ) W/l+l) — * 


(B n + 1, W n ) 
( B n , W n + 1) 
(B n — 1, W n ) 
( B n , W n - 1) 


fb,w( n ) 


B r 


W n + B 


n 


As mentioned, we stop whenever fb,w(n) = 0 or (n) = 1, so these are absorbing states of 
the process (fb,w(n))^ = 0 . Alternatively, instead of stopping the process here, we could condi¬ 
tion on being on the event of non-extinction, i.e. that not all balls disappear from the urn. The 
probability of this event, which is positive, will appear implicitly later on. 

We shall again use the continuous time embedding to evaluate the evolution of the urn. That 
is, we consider indepndent black and white birth-death process {Bi{t)) b i=l and 
started at Bi{ 0) = W l (0) = 1, with birth rates p and death rates 1 — p. It is again easy to 
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show that the discrete process (B n , W n )^L 0 has the same transition probabilities as the continu¬ 
ous process , so we will study the quantity 


9b,w(t) 


Eti Bj(t) 

EUm+j:r=im) 


instead of its discrete analogue f b , w (n). Again we stop the process if we ever reach gb,w{t) = 0 
or 9b,w (f) = 1. Since the processes are equivalent, we will be able to show that almost sure 
convergence of g b , w (t) implies almost sure convergence of ft KW (n) to the same limit. 


Proposition 2.4. The limit lini/^oo gi I V! (t ) exists almost surely, and its distribution is given by 
the mixture 


n, w S o + r b>w Beta(G b , H w ) + r w>b S i, 

where r b)W = y b (l - 5^7™), r W)b = 7“ (l - ^ 7 b )> r* b w = {l-y b )(l-y w ), and G bl H w 
are independent discrete random variables with probability mass functions 

n*[G„ = fc] = ^ Q U - 7)V-‘, k = 1,.... 6, 

«■[«» = fc] = ^ (”) (1 - 7) VA k = i,...,w, 

i.e. binomial random variables conditioned on not being zero. 

Proof. Let T b = inf {t : Y!i=iBi(t) = 0 } and 07, = infjf : Wj(i) = 0 } be the 

extinction times of the two processes. The event that the white process dies out is {t w < 00}, 
and similar for the black process. Note in particular that these events arc independent. 

On the event {r b < o w < 00} U {77 < a w = 00} we have g b , w (r b ) = 0 . The first event 
occurs with probability jf^ b 'j b+w , since the probability that one of the w white processes is 
the last one to die is (conditional on all b + w processes dying), by symmetry, and both 
processes die with probability y b+w , by independence. The second event occurs with probability 
7 b (l — 7™), and the sum of these probabilities is r bfW . By symmetry, the probability of the event 
Ww < n < 00} U {r b < a w = 00} is r w , b . 

It is clear that g b , w {T b ) = 0 on {r b < cr w < 00} U {77 < o w = 00}. Since 0 is an 
absorbing state, this gives a point mass at 0 with relative mass r b , w • Similarly g b , w (cr w ) = 1 on 
{a w < 17 < 00} U {77 < a w = 00}, giving a point mass at 1 with relative mass r w>b . 

The event that neither process dies out (so at least one white and at least one black process 
survives) is the event {77 = 00, er,„ = 00}. By independence this has probability 

P[t7 = 00, o w = 00] = (1 - 7 6 )(1 - 7“) =: r* b w . 


Let G b , H w be as in the statement. On the event {77, = 00, a w = 00} we have at least one 
surviving process of each kind. Conditional on this, since each black process survives with prob¬ 
ability 1 — 7 independently of the other, the number of surviving black processes is distributed 
like G b . Similarly the number of surviving white processes on this event is distributed like 
H w . Recall the scaling limit in Lemma 2.1 which holds for each surviving process separately. 
Therefore, on this event, 


9b,w(f ) — 




U 


e —(2p—i)t E g ll B h (t) + e-Pi-i* W fc (t) 

as t —> 00, where U ~ T(Gb, 1//3),V ~ T(H W , 1 // 3 ). But then jj^y ~ 


U + V ’ 

Beta (G b ,H w ). 


□ 


Proposition |2,4| immediately transfers to the discrete-time urn model. 
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1.000 2,000 3,000 4,000 5,000 

# black balls 


FlG. 1. The left-most figure is a simulation of a single instance of the urn 
model with (6, w ) = (2, 3) and p = 3/5, with 300 drawings. The second one 
is a simulation of 30 trials with 20 000 drawings each, and parameters ( b , w) = 
(2, 3) and p = 3/4. In particular, note that diagonal-crossing and absorption 
tends to happen early, if at all. The simulations were done in MatLab. 


Theorem 2.5. With the same notation as in Proposition 
surely, with distribution given by the mixture 


2.4 


lim n 


fb,w (ti) exists almost 


n, w 5 o + rl ;w Beta(G b , H w ) + r wfi 5i. 


Proof. Let T n be the time of the n:th birth or death in the process (f2!j= i Y17= l ^4 W j ’ 

conditional on survival. Since f b)W (n ) = g n ,w(T n ), it suffices to show that T n oo as 
n —> oo. The proof of this is similar to the argument in the proof of Lemma [2TT| □ 

It is natural to extend the above results to k colours rather than 2. This corresponds to pro¬ 
jecting our infinite urn model onto k fixed colours. Namely, consider an urn with k colours, and 
let Xi(n ) denote the number of balls of colour i at time n, with initial condition XfO) > 0. 
At each time step, draw a ball uniformly at random. With probability p, replace it to the urn 
along with a ball of the same colour. With probability 1 — p, remove it from the urn. Let 
S(ri) = Xi(n) + • • • Xk(n). With the same method as above, it is not difficult to show, on the 
event of non-extinction of the urn, that 

(Xi(n)/S(ra),..., X k (n)/S(n)) A (Y u ..., Y k ) 


where each Y* is distributed as a mixture of Dirichlet distributions (possibly degenerate to sub¬ 
spaces), the parameters of which will be binomially distributed conditioned on not all being 
zero. We leave the details for the reader. 

We remark that p = 1 gives us the original Polya urn model. Indeed, for p = 1 we have 
7 = 0, and it is readily checked that the limit in Theorem 2.5 reduces to Bcta(6. w), which is 
a classical result in the literature. We also point out that all birth-death processes go extinct 
almost surely in the case p < 1/2, so we would have convergence of g b ,w(t) and f b , w (n) to a 
convex combination of two point masses at 0 and 1. Flowever, even such urns have been studied 
in the literature, see e.g. Ifl3ll which studies a class of diminishing urn processes. For instance, 
our urn model with p = 0 may be seen as sampling without replacement. 

Let us return to the supercritical case p > 1/2 and concentrate on the event that there is an 
equal number of black and white balls in the urn. It is easy to show that this occurs at most 
finitely many times. 


Corollary 2.6. With probability 1, the event B n = W n occurs for at most finitely many n. 
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Proof. For all possible realisations of G b , H w , the distribution Beta(G},, H w ) is absolutely con¬ 
tinuous on (0,1), so the mixture in Theorem |2. 5 1 is also absolutely continuous on (0,1). There¬ 
fore n - ll f B converges almost surely to some constant c / 1/2. But this means that the fraction 
equals 1/2 at most a finite number of times. □ 


Knowing that B n = W n for at most finitely many n, it is natural to ask what the probability 
is that this event occurs at all. Galashin 0 and Antal, Ben-Naim and Krapivsky |2l considered 
this problem for p = 1, i.e. the classical Polya urn model. They view the process ( B n . W n ) as a 
random walk on Z 2 . For p = 1 this process can only go right or up, so it is possible to count the 
number of lattice paths from a given starting point to a fixed point on the diagonal. Using the 
fact that these paths are exchangeable and summing over all points on the diagonal, it is possible 
to bound the probability that the process ever hits the diagonal. For p < 1 there are infinitely 
many paths to any point on the diagonal, so this approach is not possible here, but an argument 
given by Wallstrom lfT8l for the case p = 1 can easily be adapted to our setting. 


Proposition 2.7. Suppose b > w. Using the same notation as in Proposition 2.4 let F bw he the 
distribution function of the mixture r btW 5o + r £ w Beta(G bl H w ) + r wb 8\. Let P(b, w) denote the 
probability that B ^ for some n. Then 


P(b,w) = 2F biW (l/2). 


Proof. Let p be the random limit of f b , w ( n ) = B B jf w , and let £ = inf{n > 0 : f b , w ( n ) = |}- 
Since P[</j = 1/2] = 0, we have {8 < oo} = {8 < oo,ip > 1/2} U {8 < oo,<p < 1/2}. 
The process {B n ,W n )^ =£+1 is Markovian, so it depends only on {B^. Ws). Thus, on the 
event 8 < oo, the limit ip is chosen according to F Bs>B£ , which is symmetrical around 1 / 2 . 
Therefore P[£ < oo,p < 1/2] = P[£ < oo,p > 1/2]. But since b > w we have that 
{p < 1/2} C {8 < oo}, so P [8 < oo, tp < 1/2] = P[</? < 1/2]. Thus 


P(b,w) = P [8 < oo] = 2¥[<p < 1/2] = 2F btW (l/2). 


□ 


Later on it will be useful to have a bound on the probability P(b, 1) for large b. This will 
gives us the probability that a new colour (which starts with a single ball) in the infinite-colour 
urn model ever catches up with an old colour. In the next lemma we show that this probability 
decreases at least exponentially in b. 

Lemma 2.8. The equalisation probability P(b. 1) satisfies the inequality 


P{b, 1 ) < 2 (^-*- 


b 
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Proof. By Proposition [277] we have 

P(M) = 2 *m( 1 / 2 ) 


= V 1 - 


b+1 


7 


+ 2(1 - 7 6 ) (1 - 7 ) J2 k f Q ' 2 xk ~\ l ~ Q (1 7) " 7< 


k.b—k 


1 — 7 fe 


S27 + 2^-LP)(l- 7 )V 


fc=l 


k n h—k 




k =0 


2 1^+7 


2 I — 

,2 p 


□ 


We now finally return to the urn model with an infinite number of colours. The bound on 
P(b , 1) gives us good enough control over the probability that a small colour ever overtakes a 
large colour. Using this bound and the Borel-Cantelli lemma, we show now that there can have 
been at most a finite number of colours that were ever the leader. The proof relies on Lemma 


Proposition 2.9. Almost surely there can have been at most finitely many leading colours. 

Proof. Let (T n )™ =1 be the birth times of new colours. Let PL n be the event that the colour born 
at time T n ever becomes as large as the leading colour at time T n . The joint behaviour of the 
sizes of a new colour and the currently leading colour is described by the 2 -dimensional urn 
model above started from (M n , 1). Now, for any r G N, we let C r be the event that M n > n 1 / 2 ^ 
for all n > r. Note that this implies that M n > r 1 / 2 d n 1 / 2 h for all n > 1. For each fixed r, we 
have 

, i N r -l/2/3fo/2/3 

IP [Hn n c r \ < sup P(A, 1) < 2 ( — ) 

A > r -l/2p n l/(2/}) \ Z V J 


2.8 and the growth rate in Lemma [27T 


oo o° / i v r*— 1 / 2 > 3 ri 1 / 2 / 3 

y! ^\Hn n c r \ < 2 ^ ) < 00 , 

n =1 n= 1 k P/ 

where the sum converges by e.g. integral comparison. The Borel-Cantelli lemma implies that 
Tin Cl Cr occurs for infinitely many n with probability 0. Recall Theorem |2.2[ i.e. that the 
number of balls of any fixed colour, conditional on survival, grows like vNn ^, where v is some 
positive random variable. Therefore P[C r ] ^ 1 as r oc, which implies that PL n occurs for 
infinitely many n with probability 0. Therefore, with probability 1, only finitely many colours 
can have been leaders. □ 

Theorem 2.10. Almost surely there is a dominating colour. 

Proof. By Proposition |2.9| there can have at most a finite number of colours of maximal size, 
and by Corollary |2.6| these can have changed leader at most finitely many times. Therefore, with 
probability 1 , there is a colour that is always the largest after some random but finite time. □ 
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In the corresponding random graph model, this says that almost surely there is a persistent 
clique-hub. Recall that M n denotes the size of the leading colour at time n and N n the total 
number of balls in the urn at time n. Since there almost surely is a dominating colour and we 
know the growth rate of any fixed colour by Theorem |2.2| (in particular that of the dominating 
colour), we obtain the following theorem. 

Theorem 2.11. There is a random variable // > 0 such that 

M n uNW. 

By taking logarithms, this also implies the strengthening of the result of Backhausz and Mori 
mentioned in the introduction. 
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